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Abstract 

In this paper, we consider the following graph partitioning 
problem: The input is an undirected graph G = (V, E), a 
balance parameter b e (0, 1 /2] and a target conductance value 
y e (0, 1). The output is a cut which, if non-empty, is of 
conductance at most 0(f), for some function f(G,y), and 
which is either balanced or well correlated with all cuts of 
conductance at most y. In a seminal paper, Spielman and Teng 

|16| gave an 0(|£]/y 2 )-time algorithm for / = 
and used it to decompose graphs into a collection of near- 
expanders lfl8l . 

We present a new spectral algorithm for this problem 
which runs in time 0(\E\/y) for / = Jy. Our result yields 
the first nearly-linear time algorithm for the classic Balanced 
Separator problem that achieves the asymptotically optimal 
approximation guarantee for spectral methods. 

Our method has the advantage of being conceptually 
simple and relies on a primal-dual semidefinite-programming 
(SDP) approach. We first consider a natural SDP relaxation 
for the Balanced Separator problem. While it is easy to 
obtain from this SDP a certificate of the fact that the graph 
has no balanced cut of conductance less than y, somewhat 
surprisingly, we can obtain a certificate for the stronger 
correlation condition. This is achieved via a novel separation 
oracle for our SDP and by appealing to Arora and Kale's [3 1 
framework to bound the running time. Our result contains 
technical ingredients that may be of independent interest. 

1 Introduction 

1.1 Graph Partitioning. Given a graph G = (V,E), the 

def - _ 

conductance of a cut (S, S) is <f>(S) - l £ ( s . s )l/min{voi(S),voi(S)), 
where vol(5 ) is the sum of the degrees of the vertices in the set 
S. A cut (S,S) is ^-balanced if min{vol(5), vol(S)} ^b-volV. 
A graph partitioning problem of widespread interest is the 
Balanced Separator problem: given G = (V,E), a constant 1 
balance parameter b e (0, and a conductance value y e 
(0, 1), does G have a ^-balanced cut S such that <p(S) < y? 
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Balanced Separator is an intensely studied problem in 
both theory and practice. It has far-reaching connections to 
spectral graph theory, the study of random walks and metric 
embeddings. Besides being a theoretically rich problem, 
Balanced Separator is of great practical importance, as it 
plays a central role in the design of recursive algorithms, 
image segmentation and clustering. 

Since Balanced Separator is an NP-hard problem [6|, 
we seek approximation algorithms that either output a cut 
of conductance at-most f(y, log | V|) and balance Q.i,(l) or a 
certificate that G has no ^-balanced cut of conductance at most 
y. In their seminal series of papers lfT6l[T8lfi7ll . Spielman and 
Teng use an approximation algorithm for Balanced Separator 
as a fundamental primitive to decompose the instance graph 
into a collection of near-expanders. This decomposition is 
then used to construct spectral sparsifiers and solve systems 
of linear equations in nearly linear time. Their algorithm 
has two crucial features: first, it runs in nearly linear time; 
second, in the case that no balanced cut exists in the graph, it 
outputs a certificate of a special form. This certificate consists 
of an unbalanced cut of small conductance which is well- 
correlated with all low-conductance cuts in the graph. We 



prove in Section A. 1 in the Appendix that such a cut is indeed 



a negative certificate for the Balanced Separator problem. 
Formally, they prove the following: 

Theorem 1.1. M6V Given a graph G, a balance parameter 
b e (0, 72], b — Q(l) and a conductance value y e (0, 1), 
Partition(G, b, y) runs in time t and outputs a cut S C V such 
that vol(S) < 7 /8 • vol(G), <f>(S) < fi or S = 0, and with high 
probability, either 

1. S is Q,b(l)-balanced, or 

2. for all C c V such that vol(C) < V 2 ■ vol(G) and 

m < o(y), > 1/2. 

Originally, Spielman and Teng showed Theorem |1.1| with 

fi — O I yjy log 3 nj and t - O ( m /y 2 ) . This was subsequently 

improved by Andersen, Chung and Lang [ 1 1 and then by An- 
dersen and Peres [2l to the current best of f\ = O ( -\Jy log n) 

and t = G( m /vr)- All these results made use of bounds on the 
convergence of random walk processes on the instance graph, 
such as the Lovasz-Simonovits bounds [ 1 3 1 . These bounds 



yield the log n factor in the approximation guarantee, which 
appears hard to remove while following this approach. 

1.2 Our Contribution In this paper, we use a semidefinite 
programming approach to design a new spectral algorithm, 
called BalCut, that improves on the result of Theorem |1.1| 
The following is our main result. 

Theorem 1.2. (Main Theorem) Given a graph G — (V,E), a 
balance parameter b e (0, b — £1(1), and a conductance 
value y e (0,1), BalCut(G, b, y) runs in time 0( m /y) and 
outputs a cut S c V such that vol(.S') < 7 / 8 ' vol(G), if S + 
then<p(S) < 6>i( Vr) , and with high probability, either 

1. S is Q,i,(l)-balanced, or 

2. for all C c V such that vol(C) < V 2 • vol(G) and 
<P(C) < 0(y), > 1/2- 

Note that our result improves the parameters of previous 
algorithms by eliminating the log n factor in the quality of the 
cut output, making the approximation comparable to the best 
that can be hoped for using spectral methods [7 1. Our result is 
also conceptually simple: we use the primal-dual framework 
of Arora and Kale to solve SDPs combinatorially, and 



we give a new separation oracle that yields Theorem 1.2 



Finally, our result implies an approximation algorithm for 
Balanced Separator, as the guarantee of Theorem 1 1 . 2 1 on the 
cut S output by BalCut also implies a lower bound on the 
conductance of balanced cuts of G. The proof can be found in 



Section A. 1 in the Appendix. 



Corollary 1.1. Given an instance graph G, a balance pa- 
rameter b € (0, 72] and a target conductance ye (0,1], Bal- 
Cut (G, b, y) either outputs an Q,i,(l)-balanced cut of conduc- 
tance at most Ob( y/y) or a certificate that all Q.b(Y)-balanced 
cuts have conductance at least €lb(y). The running time of the 
algorithm is O(mjy). 

This is the first nearly-linear-time spectral algorithm for Bal- 
anced Separator that achieves the asymptotically optimal ap- 
proximation guarantee for spectral methods. 

1.3 Graph Decomposition. The main application of The- 
orem 1 1 . 1 1 is the construction of a particular kind of graph 
decomposition. In this decomposition, we wish to parti- 
tion the vertex set of the instance graph V into components 
V\, . . . ,Vi, . . . ,Vk such that the graph induced by G on each V, 
has conductance as large as possible, while at most a constant 
fraction of the edges have endpoints in different components. 
These decompositions are a useful algorithmic tool in several 
areas HHEEUEDD. 

Kannan, Vempala and Vetta [10] construct such decom- 
positions achieving a conductance value of Q(7iog 2 «). How- 
ever, their algorithm runs in time 0(m 2 ) on some instances. 

Spielman and Teng |[T8ll relax this notion of decomposi- 
tion by only requiring that each V, be contained in a superset 



Wj in G, where Wj has large induced conductance in G. In 
the same work, they show that this relaxed notion of decom- 
position suffices for the purposes of sparsification by random 
sampling. The advantage of this relaxation is that it is now 
possible to compute this decomposition in nearly-linear time 
by recursively applying the algorithm of Theorem 1 1.1 1 

Theorem 1.3. Ill 8V Assume the existence of an algorithm 
achieving parameters t and f\ in Theorem 1.1 Given y e 



(0, 1), in time 0(t), it is possible to construct a decomposi- 
tions of the instance graph G into components Vu . . . , Vj such 
that: 

1. for each V,-, there exists Wi 2 V, such that the conduc- 
tance of the graph induced by G on Wi is Q(y/iog«). 

2. the fraction of edges with endpoints in different compo- 
nents is 0(fi logn). 



Using Theorem 1.3 Spielman and Teng showed the exis- 



tence of a decomposition achieving conductance Q(7iog 6 n). 



Our improved results in Theorem 1.2 imply that we can 
obtain decompositions of the same kind with conductance 
bound Q(7iog 3 «). Our improvement also implies speed-ups 
in the sparsification procedure described by Spielman and 
Teng l!T8ll . However, this result has since been superceded 
by work of Koutis, Miller and Peng [ 12] that gives a very fast 
linear equation solver that can be used to compute sampling 
probabilities for each edge, yielding a spectral sparsifier with 
high probability 1 15 1. 

Our work leaves open the important question posed by 
Spielman 1 141 of whether stronger decompositions, of the 
kind proposed by Kannan, Vempala and Vetta iflOl . can be 
produced in nearly-linear time. 

1.4 Overview of Techniques 

Spectral Approach. The simplest algorithm for Bal- 
anced Separator, also used by Kannan et al. ifTOl . is the recur- 
sive spectral algorithm. This algorithm finds the minimum- 
conductance sweep cut of the second eigenvector of G, re- 
moves the cut and all adjacent edges from G, and reiterates 
on the remaining graph. The algorithm stops when the union 
of the cuts removed becomes ^-balanced or when the resid- 
ual graph is found to have spectral gap at least y, certifying 
that no more progress can be made. As every cut may only 
remove 0(1) volume and the eigenvector computation takes 
Q.(m) time, this algorithm may have quadratic running time. 
It can be shown using Cheeger's Inequality [5] that the cut this 
procedure outputs is of conductance at most 0( -Jy). 

Spielman-Teng Approach. The algorithm of Spielman 
and Teng which proves Theorem |l.l| is also spectral in nature 
and uses, as the main subroutine, local random walks that run 
in time proportional to the volume of the output cut to find 
sparse cuts around vertices of the graphs. These local methods 
are based on non-trivial random walks on the input graph and 
aggregation of the information obtained from these walks, all 
performed while maintaining nearly-linear running time. 



Our Approach. We depart from the random-walk 
paradigm and first consider a natural SDP relaxation for the 
Balanced Separator problem, which BalCut solves approxi- 
mately using a primal-dual method. Intuitively, BalCut man- 
ages to maintain the approximation guarantee of the recur- 
sive spectral algorithm while running in nearly-linear time by 
considering a distribution over eigenvectors, represented as a 
vector embedding of the vertices, rather than a single eigen- 
vector, at each iteration. The sweep cut over the eigenvec- 
tor is replaced by a sweep cut over the radius of the vectors 
in the embedding (see Figure [T}. Moreover, at any iteration, 
rather than removing the unbalanced cut found, BalCut pe- 
nalizes it by modifying the graph so that it is unlikely but still 
possible for it to turn up in future iterations. Hence, in both 
its cut-finding and cut-eliminating procedures, BalCut tends 
to "hedge its bets" more than the greedy recursive spectral 
method. This hedging, which ultimately allows BalCut to 
achieve its faster running time, is implicit in the primal-dual 
framework of Arora and Kale [3]. 
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Figure 1: Schematic representation of the speed-up intro- 
duced by BalCut when the instance graph contains many 
unbalanced cuts of low conductance. Let v\ and U2 be the 
two slowest-mixing eigenvectors of G. Assume that their 
minimum-conductance sweep cuts S i and S 2 are unbalanced 
cuts of conductance less than y. If we use the recursive algo- 
rithm of Kannan et al. IfTOl . two iterations could be required 
to remove S 1 and S 2. However, BalCut considers a multidi- 
mensional embedding containing contributions from multiple 
eigenvectors and performs a radial sweep cut. This allows S 1 
and S 2 to be removed in a single iteration. 



The SDP relaxation appears in Figure 1.4 We denote by 

def 

// : V 1— > Rj-o the distribution defined as yu,- = */vqI(G), and 

def 

by di the degree of the /-th vertex. Also, v a „ q = /jjU,. Even 
though our algorithm uses the SDP, at the core, it is spectral 
in nature, as it relies on the matrix-vector multiplication 
primitive. Hence, if one delves deeper, a random walk 
interpretation can be derived for our algorithm. 
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Figure 2: SDP for Z?-Balanced Separator 



The Primal-Dual Framework. For our SDP, the 
method of Arora and Kale can be understood as a game be- 
tween two players: an embedding player and an oracle player. 
The embedding player, in every round of this game, gives a 
candidate vector embedding of the vertices of the instance 
graph to the oracle player. We show that, if we are lucky 
and the embedding is feasible for the SDP and, in addition, 
also has the property that for a large set S, for every i e S, 



u avg 



I2 < (we call such an embedding round- 

able), then a projection of the vectors along a random direc- 
tion followed by a sweep cut gives an 0^,(1 )-balanced cut of 
conductance at most 0( y[y). The difficult case is when the 
embedding given to the oracle player is not roundable. In this 
case, the oracle outputs a candidate dual solution along with a 
cut. The oracle obtains this cut by performing a radial sweep 
cut of the vectors given by the embedding player. If at any 
point in this game the union of cuts output by the oracle be- 
comes balanced, we output this union and stop. We show that 
such a cut is of conductance at most Obi yjy)- If this union 
of cuts is not balanced, then the embedding player uses the 
dual solution output by the oracle to update the embedding. 
Finally, the matrix-exponential update rule ensures that this 
game cannot keep on going for more that O( lo s"//y) rounds. 
Hence, if a balanced cut is not found after this many rounds, 
we certify that the graph does not contain any ^-balanced cut 
of conductance less than y. To achieve a nearly-linear running 
time, we maintain only a log n-dimensional sketch of the em- 
bedding. The guarantee on the running time then follows by 
noticing that, in each iteration, the most expensive computa- 
tional step for each player is a logarithmic number of matrix- 
vector multiplications, which takes at most 0(m) time. 

The reason why our approach yields the desired corre- 
lation condition in Theorem [L2] is that, if no balanced cut is 
found, every unbalanced cut of conductance lower than y will, 
at some iteration, have a lot of its vertices mapped to vectors 
of large radius. At that iteration, the cut output by the oracle 
player will have a large correlation with the target cut, which 
implies that the union of cuts output by the oracle player will 
also display such large correlation. This intuition is formal- 



ized in the proof of Theorem 1 .2 

Our Contribution. The implementation of the oracle 
player, specifically dealing with the case when the embedding 
is not roundable, is the main technical novelty of the paper. 
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Studying the problem in the SDP-framework is the main 
conceptual novelty. The main advantage of using SDPs to 
design a spectral algorithm seems to be that SDP solutions 
provide a simple representation for possibly complex random- 
walk objects. Furthermore, the benefits of using a carefully 
designed SDP formulation can often be reaped with little or 
no burden on the running time of the algorithm, thanks to the 
primal-dual framework of Arora and Kale Q. 



1.5 Rest of the Paper In Section 2.1 we set the notation 
for the paper. In Section 2.2 we present our SDP and its 



dual, and also define the notion of a roundable embedding. In 



Section 2.3 we present the algorithm BalCut and the separa- 
tion oracle Oracle, and reduce the task of proving Theorem 
1.2| to proving statements about the Oracle. Section [3] con- 
tains the proof of the main theorem about the Oracle used 



in Section 2.3 For clarity of presentation, several proofs are 



omitted from the above sections and appear in the appendix. 

2 Algorithm Statement and Main Theorems 
2.1 Notation 

Instance graph and edge volume. We denote by G = 
(V,E) the unweighted instance graph, where |V] = n and 
\E\ - m. We let d e R y , be the degree vector of G, i.e. 
di is the degree of vertex i. We mostly work with the edge 

def def 

measure fi over V, defined as fij - fi(f) = */2m. For a subset 
S c V, we also define fis as the edge measure over S, i.e. 

. „ def . .. . 

A*s(0 = M/tiS). 

Special graphs For a subset S c V, we denote by Ks the 
complete graph over S such that edge {/, j} has weight fiifij 
for i, j e S and otherwise. K v is the complete graph with 
weight fijfi j between every pair i, j e V. 

Graph matrices. For an undirected graph H = (V, E H ), 
let A(H) denote the adjacency matrix of H and D(H) the diag- 
onal matrix of degrees of H. The (combinatorial) Laplacian 

of H is defined as L(H) d = D(H) - A(H). Note that for all 
x e R v , x T L(H)x = zZ{ij}eE H ( x i _ x jY ''■ By D and L, we de- 
note D(G) and L(G) respectively. 

Vector and matrix notation. For a symmetric matrix M, 
we will use M > to denote that it is positive semi-definite 
and M > to denote that it is positive definite. The expression 
A > B is equivalent to A - B > 0. For two matrices A, B of 

equal dimensions, denote A • B = f Tt(A t B) = Y^ij ^ij ' Bij- F° r 
a matrix A, we indicate by t A the time necessary to compute 
the matrix-vector multiplications Au for any vector u. 

Embedding notation. We will deal with vector embed- 
dings of G, where each vertex i e V is mapped to a vector 
Vj e W l . For such an embedding {u,}/ e y, we denote by u avg the 

mean vector, i.e. u avg =' Yiiev Given a vector embedding 
of {vj e Mr}i € v, recall that X > 0, is the Gram matrix of the 
embedding if X u = vjvj. For any X e R VxV ,X > 0, we call 
{vi}i(=v the embedding corresponding to X if X is the Gram ma- 
trix of {D;},- e y. For i G V, we denote by /?, the matrix such that 



RfX=\\ Vi -v m \\l 

Basic facts. We will alternatively use vector and matrix 
notation to reason about the graph embeddings. The follow- 
ing are some simple conversions between vectors and matrix 
forms and some basic geometric facts which follow immedi- 
ately from definitions. 

Fact 2.1. E^Jd,— y avg |g = l /2-E {i j hflXfl \\vi-Uj\\l = L(K V )»X. 

Fact 2.2. For a subsets c V, £ ieS fifii > fJ.(S)L(K v )-L(K s ). 

Fact 2.3. For a subset S QV, E^-j^x^ - vjW^ = 2 ■ V/XS) 2 • 
L(K S ).X. 

Modified matrix exponential update. Let S D e the 

subspace of R. v orthogonal to D = f VV5n • D'^l and let I 

be the identity over S D , i.e. J = f / - i/2m ■ D' /2 n T D' /2 . For a 
positive s and a symmetric matrix M e R VxV , we define 

def D-^-dn^-D-'r-MD-'r-p-^ 

U S (A) = 2m : : . 

J . e -{2m-e)D-^ 1 MD-'l 1 

The following fact about I will also be needed: 
Fact 2.4. I -2m- D- ,/2 L(K v )D- ,/2 . 

2.2 SDP Formulation We consider an SDP relaxation to 
the decision problem of determining whether the instance 
graph G has a ^-balanced cut of conductance at most y. 
The SDP feasibility program psdp(G, b, y) appears in Figure 
2.2 where we also rewrite the program in matrix notation, 



using Fact 2.1 and the definition of /?,-. psdp can be seen as 



psdp(G, b, y) : 



E Wvt-VjWl «4y 



E \\vj-v wq \\\ = 1 



Vl € V Ik - V, 



l-b 



i ^avgll2 



psdp(G,b,y) : - L*X <4y 
m 



L(K V ) • X = 1 



1 -b 

Vz € V R;»X < 

b 

Figure 3: SDP for Z?-Balanced Separator 

a scaled version of the balanced-cut SDP of |4|, modified 
by replacing u avg for the origin and removing the triangle- 
inequality constraints. The first change makes our psdp 
invariant under translation of the embeddings and makes the 
connection to spectral methods more explicit. Indeed, the 
first two constraints of psdpnow exactly correspond to the 
standard eigenvector problem, with the addition of the 



constraint ideally forcing all entries in the eigenvector not 
to be too far from the mean, just as it would be the case if 
the eigenvector exactly corresponded to a balanced cut. The 
removal of the triangle-inequality constraints causes psdp to 
only deal with the spectral structure of L and not to have a 
flow component. For the rest of the paper, denote by A the set 
{X e R VxV ,X > : L(K V ) • X = 1). 

The following simple lemma establishes that psdp is indeed a 
relaxation for the integral decision question and is proved in 
Section IA~2l 



A roundable embedding can be converted into a balanced 



Lemma 2.1. (SDP is a Relaxation) // there exists a b- 
balanced cut S with (f>(S ) < y, then psdp(G, b, y) has a 
feasible solution. 

BalCut will use the primal-dual approach of [ 3 1 to determine 
the feasibility of psdp(G,b,y). When psdpis infeasible, Bal- 
Cut will output a solution to the dual dsdp(G, b, y), shown in 
Figure |4] 



dsdp(G, b,y): a - J] # > Ay 



— ■L + YjPiRi-aLiKy)^^ 

ieV 



Figure 4: dsdp(G,b,y) feasibility problem 

In the rest of the paper, we are going to use the following 
shorthands for the dual constraints 



def L 



ieV 



2m 



ieV 



Notice that V(a,/3) is a scalar, while M(a,/3) is a matrix in 
R VxV . Given X > 0, a choice of (a, (3) such that V(a,{3) > 4y 
and M(a,f3) • X > corresponds to a hyperplane separating 
X from the feasible region of psdp(G,b,y) and constitutes a 
certificate that X is not feasible. 

Ideally, BalCut would produce a feasible solution to psdpand 
then round it to a balanced cut. However, as discussed in 0, 
it often suffices to find a solution "close" to feasible for the 
rounding procedure to apply. In the case of psdp, the concept 
of "closeness" is captured by the notion of roundable solution. 

Definition 2.1. (Roundable Embedding) Given an embedding 
{Vi} ieV , letR = {i € V : ||t); - v avg \\l < 32 • d-b)/b}. We say that 
{y/}, e y is a roundable solution to psdp(G, b,y) if: 

- E (i j} E £ - Vj\\\ < 2y, 

- Ej-p \\Vj ~ VavgWj = 1, 

- E^h^lh - vj\\l > V64. 



cut of the conductance required by Theorem 1.2 by using a 
standard projection rounding, which is a simple extension of 
an argument already appearing in J4] and 0. The rounding 



procedure ProjRound is described precisely in Section A. 4 
where the following theorem is proved. 

Theorem 2.1. (Rounding Roundable Embeddings) // {v, e 
JR^liey is d roundable solution to psdp(G, b, y), then 
PROjRouND({y,}, e y, b) produces a Q/,(l)- balanced cut of con- 
ductance Ob ( -\Zy) with high probability in time 0(nd + m). 

2.3 Primal-Dual Framework 

Separation Oracle. The problem of checking the fea- 
sibility of a SDP can be reduced to that of, given a candi- 
date solution X, to check whether it is close to feasible and, 
if not, provide a certificate of infeasibility in the form of a 
hyperplane separating X from the feasible set. The algorithm 
performing this computation is known as a separation oracle. 
Arora and Kale show that the original feasiblity problem can 
be solved very efficiently if there exists a separation oracle 
obeying a number of conditions. We introduce the concept 
of good separation oracle to capture these conditions for the 
program psdp(G,yS, y). 

Definition 2.2. (Good Separation Oracle) An algorithm is a 
good separation oracle if on input some representation ofX, 
the algorithm either finds X to be a roundable solution to 
psdp(G, b, y) or outputs coefficents a, [3 such that V(a,f3) > 
3/4-y, M(a,p)»X > i/64-y and -yL(K v ) < M{a,p) < 3L(K V ). 

Algorithmic Scheme. We adapt the techniques of [3| 
to our setting, where we require feasible solutions to be in 
A rather than having trace equal to 1. The argument is 
a simple modification of the anaylsis of (3| and in |[T9l . 
The algorithmic strategy of is to produce a sequence of 
candidate primal solutions X {1 \ . . . ,X (T) iteratively, such that 
X (,) e A for all t . 

Our starting point X w will be the solution lm jn-\ ■ D l . 
At every iteration, a good separation oracle Oracle will take 
X w and either guarantee that X (,) is roundable or output 
coefficents a w ,0'^ certifying the infeasiblity of X^'K The 
algorithm makes use of the information contained in a ( '\/3^ 
by updating the next candidate solution as follows: 

(2.1) = f s/6 ■ (M(cP,fiP) + yL(K v j) 



V (=1 



where s > is a parameter of the algorithm. The following is 
immediate. 

Lemma 2.2. For all t > 0, X {,) e A. 

Following [3 1, we prove that, after a small number of itera- 
tions this algorithm either yields a roundable embedding or a 
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Input: An instance graph G = (V, E), a balance value b e (0, 1/2] such that b - £2(1), a conductance value y e (0, 1). 
Lete= i/i30. For t= 1,2,..., T = oi^f) : 

- Compute the embedding {s[° } fev corresponding to X {,) = U £ [Z'j 7 Ji p(j) ) ■ If t = l.-^ = #s(0) = 2 "'/«-i • D" 1 - 

- Execute Oracle^G, b, y, {0®}; g yj . 

- If Oracle finds that {u®}j e y is roundable, run ProjRound^G, b, {5®};eyJ > output the resulting cut and terminate. 

- Otherwise, Oracle outputs coefficients (a (f) ,j8 w ) and cut B w . 

- Let C {,) d = U/=i fi (0 - If C {,) is ^-balanced, output C (,) and terminate. 

- Otherwise, let P (t) =' - £ /6 • (.M (aftKft'A + yL(K v fj and proceed to the next iteration. 
Output S = lXi B<0 - Also 0Ut P ut oe = 1 It Yj =l a {,) and ~B = i/r Yj=ift l) - 

Figure 5: The BalCut Algorithm 



feasible solution to dsdp(G, £>, O(y)). We present the proof in 



Section A. 3 for completeness. 



Theorem 2.2. (Iterations of Oracle, f3j) Let e = l l no. As- 
sume that the procedure Oracle is a good separation oracle . 
Then, after T — OQ°s"/y) iterations of the update of Equation 
2. 1 we either find a roundable solution to psdp(G, b, y) or the 
coeffcents a = l/r a< ~® and/3 = 1 /t ft® are a feasible 
solution to dsdp(G, b, 3 /i6 • y). 

Approximate Computation. Notice that, while we are 
seeking to construct a nearly-linear-time algorithm, we cannot 
hope to compute X w exactly and explicitly, as just maintain- 
ing the full X® matrix requires quadratic time in n. Instead, 
we settle for a approximation X (f+1) to which we define 
as 



v i=l 



The function U s is a randomized approximation to U s ob- 
tained by applying the Johnson-Linderstrauss dimension re- 
duction to the embedding corresponding to U s . U s is de- 
scribed in full in Section |A.5| where we also prove the fol- 
lowing lemma about the accuracy and sparsity of the approx- 
imation. It is essentially the same argument appearing in ||9) 
applied to our context. 



3. {di e R d ) ,- E y can be computed in time 0(t/^ + ri). 

4. for any graph H — ( V, Eh), with high probability 
(l-yM)-L(H)»X-T < L(H)»X < (l + i/64)-L(H)»X+T, 
and, for any vertex i G V, 

(1 - 1/64) ■R i »X-T<R i »X<(l + 1/64) ■ Ri • X + r, 

where t < G(Vpoiy(«)). 

This lemma shows that X w is a close approximation to X { '\ 
We will use this lemma to show that Oracle can receive X w 
as input, rath er than X^\ and still meet the conditions of 
In the rest of the paper, we assume that X w 



Theorem 



2.2 



is represented by its corresponding embedding [v] Uev 

The Oracle. Oracle is described in Figure [6] We s how 



2.2 



that Oracle on input meets the condition of Theorem 
Moreover, we show that Oracle obeys an additional condi- 
tion, which, combined with the dual guarantee of Theorem 
2.2 will yield the correlation property of BalCut. 



Theorem 2.3. (Main Theorem on Oracle) On input X^, Or- 
acle runs in time O(m) and is a good separation oracle for 
X®. Moreover, the cut B in Step^is guaranteed to exist. 

Proof of Main Theorem. We are now ready to prove 



Lemma 2.3. (Approximate Computation) Let s = 1/130. For a Theorem[L2j To show the overlap condition, we consider the 



clef ,~ r 



. def 



matrix M e R VxV , M > 0, let X =' U e (M) and X =' U E (M). 

1. X>0andXeA. 

2. The embedding {y;},- e y corresponding to X can be repre- 
sented in d — G(log n) dimensions. 



dual condition implied by Theorem 2.2 together with the cut 
Z? (,) and the values of the coefficents output by the Oracle. 

Proof. [Proof of Theorem |1.2| If at any iteration t, the em- 
bedding {u®} !€ y corresponding to X w is roundable, the stan- 
dard projection rounding ProjRound produces a cut of bal- 



1. Input The embedding {5,}, e y, corresponding to X e A. Let r, = - 5 a vgll2 for all i e V. Denote R = {i e V : rf < 

32 -a-*)/*}. 

2. Case 1: E ( y} E£ \\v t - Vj\\ 2 2 > 2y. Output a = y, /? = and B = 0. 

3. Case 2: not Case 1 and E^j^^Hi); - UjHj > 5. Then {«,•}, - e y is roundable, as X e A implies E^ = 1. 

4. Case 3: not Case 1 or 2. Relabel the vertices of V such that r\ > r 2 > . . . > r„ and let 5, = { 1, ...,/} be the y'-th sweep 
cut of r. Let z the smallest index such that p(S z ) > b /s. Let B the most balanced sweep cut among {S i, . . . , •SVi) such 
that <p{B) < 2048 • \fy. Output a = 7 /sy, B t -pry for / e B and = for i £ B. Also output the cut B. 

Figure 6: Oracle 



ance fi&(l) and conductance Obi y/y) by Theorem 2.1 Simi- 
larly, if for any f, C w is ^- balan ced, BalCut satisfies the bal- 

as 0(C W ) < 0( yjy) because 



1.2 



ance condition in Theorem 

C (t ' is the union of cuts of conductance at most 0{ \fy). 

Otherwise, after T = O( lo s"/y) iterations, by Theorem 



2.2 



we have that a = l /r2^i« (t) and P = V^Z^i/?® 
constitute a feasible solution dsdp(G, b, 3 /i6 ■ y). This implies 
that M(a,/3) > 0, i.e. 



(2.2) 



-L+y PiRi - aL(K v ) > 0. 



For any cut C such that fi(C) < l /i and 0(C) < r/i6, 
let the embedding {«,• e R}, E y be defined as M; = \Jv<&/y(C) 
for z e C and m/ = - yMO/^c) for z £ C. Then « avg = 
and Ej^Hw; - M avg ||^ = 1. Moreover, B {i j )€E \\Ui - Uj\\\ = 
i/ m . |£(c,c)|/ Al(C)/ , ( c) < 4 • 4>(C) < r/4. Let U be the Gram matrix 
of {ui e R}, w . 



We apply the lower bound of Equation 2.2 to U. By Facts 
IZD and 1231 

ZPi\\Ui - U WQ \\\ - a E \\u t - M aV gll2 

ieV 

= M{a,P) • U > 



E |Ih/-W/||, + 



Recall that, by the definition of Oracle, for all t e [T], 
a (t) > 78 • y and sf = Pi-y for i e fi w and Bf = for z £ B (t \ 
Hence, 

T 

y/4 + 7/t ■ V n C) ■ -"(%(C) + fi(B (,) n C) • Mcvko) 



-78 ■ y > 



Dividing by y and using the fact that p(C) < V 2 an d 
ju(C) < 1, we obtain 



i/ T 



y ( p(B ( '> n C) /j(fi (,) n C) 



> (7/g - 1/4) = 5/ 8 . 



Now, 

n(S n C) p(S n C) 

MQ + 2 ■ yU(C) 



n{B {,) n C) ^(B (,) n C) 



2-MQ 



so that we have 



n{S n o //(^ n C) 

+ — . -. ^ /o 



2 -MO 



Moreover, being the union of cuts of conductance 0( -Jy), S 
also has (p(S) < 0( y/y). As //(S) < */4, ^^/i-^C) < //(5) < 



| < V 8 - This finally implies that 



H(S n C) 

MO 



> V2- 



Finally, both ProjRound and Oracle run in time O(m) as 



the embedding is 0(\og ri) dimensional. By Lemma 2.3 the 



update at time t can be performed in time tu, where M = 
P^- This is a matrix of the form a-L+2, e y biRi+cL(Ky). 
The first two terms can be multiplied by a <9(logn) vectors 
in time 0(m), while the third term can be decomposed as 



L(Ky) - H^Rj by Fact 2.1 and can therefore be also 



multiplied in time 0(m). Hence, each iteration runs in time 
O(m), which shows that the total running time is 0( m /y) as 
required. 

3 Proof of Main Theorem on Oracle 

3.1 Preliminaries The following is a variant of the sweep 
cut argument of Cheeger's Inequality |5|, tailored to ensure 
that a constant fraction of the variance of the embedding is 
contained inside the output cut. For a vector x e R v , let 
supp(x) be the set of vertices where x is not zero. 

Lemma 3.1. Let x e R v ,x > 0, such that x T Lx < A and 
/i(supp(x)) < V 2 - Relabel the vertices so that x\ > xi > . . . > 
x z -\ > and x z — . . . — x„ — 0. For i e [z — 1], denote 
by S i c V, the sweep cut {1,2, ...,/}. Further, assume that 
YH=\ diX 2 t < 1, and, for some fixed k 6 [z - 1], Y!l=k ^i x j ^ °~- 
Then, there is a sweep cut S /, of x such that z — 1 > h > k and 
<P(S h ) < l/o-- V2l. 

We will also need the following simple fact. 

Fact 3.1. Given v, u, t e R d , (\\v - ?|| 2 - \\u - f|| 2 ) 2 < 



3.2 Proof of Theoreml231 

Proof. Notice that, by Markov's Inequality, /u(R) < 
b ld2-(i-b)) < h /\b. Recall that t = O (Vpoiy(n)) . 

- Case 1: E {i j )e£ \\v t - Vj\\\ = ± ■ L • X > 2y. We have 
V^/i) > y and, by Lemma [23] 



M(a,B) • X > (1 - 1/64) • 2y - y - r > 1/64 • y. 



Case 2: E/j * 



|!,;)~/iRX/j«ll y i 



roundable by Definition 2. 1 



"]»2 



> 1/64. Then {0;}j e y is 



- Cas e 3: E(,- w i~ Ut , x »Jlfy _ "jll^ < V 64 - This means that, by 
" L(K R ) • X < 1/2 ■ ^(7?) 2 • i/64 < i/i28. Hence, by 



Fact 
Fact 



23 
2.2 



128 



YjfrRi • * = Yj^i > ^ {R) - 1/128 > 1 ~ 1/32 ~ l i 

> 1 - 5/128 



ieR 



ieR 



We then have R = S g for some g e [n], where we also 
denote by S g the g largest coordinates dictated by the 
sweep cut S g . Let k < z be the the vertex in 7? such 
that Yf) , > (1 - Vi28) ■ (1 - 5/128) and Z% k ^n > 
i/i28 ■ (1 - 5 /i28). By the definition of z, we have k < g < z 
and r 2 < 8 / fe ^ 16 ■ C 1- *)/*. Hence, we have r z < 1/2 • r,-, for 

def 

all i > g. Define the vector x as x, = i/2m • fa - r z ) for 

def 

i £ S z and r,- = for i ( S z . Notice that: 

x r Lx = ( x < ~ x $ < '/2m • J] fa - r/ 

Fact[TT] „ 

< i/2m • 2^ Wi ~ Sjlla < 2y. 

Also, x > and yu(supp(x)) < b /s < 1/2, by the definition 
of z. Moreover, 



11 c <, 

J] ^,-X 2 = l/2„, ■ J] djfa - r z ) 2 < l/2m • J] ^ = 1. 

;=i ;=i 1=1 

and 

« z 
2^ dtxf = V2m ' ^ 4'fa - '"z) 2 
i=k 

g 

> yim-^diin-yi-n) 2 

i=k 



= l/2m- 1/4 ■ J] ^ 



> 1/512 • (1 - 5/128) > 1/1024 

Hence, by Lemma [XT] there exists a sweep cut S h with 
z> h>k, such that 0(5/,) < 2048 ■ -Jy. This shows that 



fi, as defined in Figure[6]exists. Moreover, it must be the 
case that S 1, Q B. As /; > k, we have 

k 

> Yj^i >Yj> d-Vl28)-(l- 5 /l28) > 1-V64. 



ieS* 



i=l 



Recall also that, by the construction of z, n(B) < b /&. 
Hence, we have 

V(a,B) = 78 • y - a-b)/b ■ n(B) ■ y > (7/g - i/s) • y > 3/4y. 

M(or,j8) • X > (1 ~ 1/64) ■ (1 - 3/ 64 )y - 7/8y - T > 1/64 • J. 

This completes all the three cases. Notice that in every case 
we have: 

i/2m • L - yL{K v ) < M(a,B) < i/2m • L + yL(K v ). 
Hence, 

-jL{K v ) < M(a,B) < 3L(K V ). 

Finally, using the fact that ftS;} !e y is embedded in (9(logn) 
dimensions, we can compute L • X in time (5(m). L{Kr) • X 
can also be computed in time 0(n) by using the decomposition 
%/>~ w ^ll»i - y ;ll2 = 2 ' E-^llf/ - u a vg s ll|, where u aV g fl is the 
mean of vectors representing vertices in R. The sweep cut over 
r takes time 0(m). Hence, the total running time is 0(m). 
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A Appendix 

A.l Proof of Corollary [Li] 

Lemma A. 1. // only the second condition in Theorem \1.1\ 
holds, then G has no Q,i,(l)-balanced cut of conductance 0(y). 

Proof. We may assume that the cut S output by BalCut is 
not Q/,(l)-balanced. Then, by the second condition, any cut 
T with vol(r) < 1/2 ■ vol(G) and <j>(T) < 0{y) must have 
voi(snr)/ vol(r) > i/ 2 . Hence, vol(T) < 2-vol(Snr) < 2-vol(S) < 
Qfe(l). This implies that there are no £2/,(l)-balanced cuts of 
conductance less than 0(y). 



Proof. [Proof of Lemma |2.l| For a ^-balanced cut (S,S) 
with <f>(S) < y. Without loss of generality, assume p:(S) < 
72. Consider the one-dimensional solution assigning y, = 
^f( s /fi(S) to i € S and v t - - y/f( s /fi(S) to / € S . Notice that 
y avg = and that - VjW^ = l /n(.SMS) for i e S , j i S . We then 
have: 



2 1 \E(S,S)\ ^ \E(S,S)\ 
E \\Vj - Vj\L - — • — < 2 ■ — 



<4-</>(S) <4y. 



Elk - Uavgllz = KS) ■eWUs)+»(S) .Pff)/„<S) = 1. 



for all i e V, 



KS) 1-6 



Vi - l>avgll 2 < — — < 



MS) b 

where the last inequality follows as S is ^-balanced. 



Proof. [Proof of Lemma 3.1 1 For all i e V, let yi = x?. By 
Cauchy-Schwarz, 

X \m - yj\ = 2j (x < ~ x ^ Xi + x ^ 

li.ME [i,j)eE, Xi>xj 



)|v{ij}€£ ) \{i,J}eE 



Hence, 



Y \yi~yj\ < a A ' 2 Y dix2 i- 

U,j)€E \ VieV 



< V2l. 



Then, let cf> be the conductance of the least conductance cut 
among S k+u S k+2 , ■ ■ -,S h . 

h h 

i=k+l i=k+\ 
h h 

> d'M' 1 = X = ^ X diX ^ = 

i=k+\ 

Hence, cf> < 7<r ■ \/2~I. 



i=k+l i=k+l 



ieV-S k 



A.l Proof of Basic Lemmata 



A.3 Primal-Dual Framework 



9 



A.3.1 Preliminaries Recall that S D is the subspace of R y and by the definition of P w , 
orthogonal to v - V V5m ■ D^ 2 \ and that I be the identity over 
S D . 

Define 

Ejm « exp(- £ y) 



\-j](M(a^) + yL(K v i) 



I • exp(-eF) 
The following simple facts will be useful. 

Fact A. 1. Let M be a symmetric matrix in R Vx,/ such that 
Mv = 0. Then, exp(M)B = v. 

Theorem A.l. Let s > and let Y {1 \ Y <T) be a sequence 
of symmetric matrices in R l/xV such that, for all i e [T], 
Yfi = 0and0<Yi<I. Then: 



j]Y (,) >((l-s)j]Y w »Z w - 

y=l V 



log n 



J, 



where Z' +1 = E e {^' i=x Y®). 

A.3.2 Proofs In the following, let M (,) d = 2m-ZT '^ZT 1/2 . 
Also, let W {t) = E E {y/r\ M (0 ) . Then, we have 



(t-\ 



Vi=l 



f'f-l 



2m • ZT I/2 £ ; 



/2 



Z M(i 

= 2m ■ D- ,/2 W (,) D- ,/2 . 



Proof. [Proof of Lemma [Z2| By Fact |2.4| 
UK V ) • X (,) = (D ,/2 ID ,/2 ) . (D- ,/2 W (,) D- ,/2 ) = J . W w = 1. 



We are now ready to complete the proof of Theorem 2.2 



Proof. [Proof of Theorem |2.2| Suppose that Oracle outputs 
coefficents (a w ,y8 w ) for T iterations. Then for all t e [T], by 
the definition of good oracle W w • M w 



= (i/2« • D ,/2 X (f) D' /2 )) . (2m • D- >/2 P (,) D-' /2 ) = X w . f* 

> i ■ (M(a w ,/? w ) . X w + yL(tfy) . X (,) ) 
> 1/6 ■ (1/64 ■ y + y) = 1/6 • 65 /64 ■ y. 

Notice that, as mr®,/J®J are the output of a good oracle, we 
have < P (,) < L(K V ), which implies < M w < I < I. 
Hence, we can apply Theorem | A. 1| to obtain: 



^M (,) > (l-e)^M w .W (,) - 



i=1 

This implies 

T 



log n 



I. 



^J(l- e ).r/^«. r ~W» 



> (1 - £) • T/6 ■ 65/64 • y - 



log n 



L(£ y ). 



Hence, 



M(or,/?) > i/r ■ I (1 - e) • T ■ 65/64 • y - -y-T] L(K V ) 



and 



M(a,J3)> (1 -e)- 65/ 64 . y- 



6 log n 



def def 

By picking e = Vi30 and T = 6 • 129 • V« ■ lo g"/r = O ( lo 8"/y) > 
we obtain M(or,jS) > ■ L(K V ). As M(a,jS)l = 0, this also 
implies 

M(a,f3) > 0. 

Finally, by the definition of good oracle, V(a,f3) > 3 / 4 ■ J- 
Hence, {a,f3) is a solution to dsdp(G,£>, 3 /is ■ j)- 

A.4 Projection Rounding The description of the rounding 
algorithm ProjRound is given in Figure [7] We remark that 
during the execution of BalCut the embedding {u, e R rf };ev 
will be represented by a projection over d — O (log n) random 
directions, so that it will suffice to take a balanced sweep cut 
of each coordinate vector. 



We now present the proof of Theorem 2. 1 The constants 



in this argument were not optimized to preserve the simplicity 
of the proof. 

A.4.1 Preliminaries. We will make use of the following 
simple facts. Recall that for y e R, sgn(j/) = 1 if y > 0, 
and -1 otherwise. 

Fact A.2. For all y,zeR,(y + z) 2 < 2(y 2 + z 2 ). 

Proof. 2(y 2 + z 2 ) - (y + z) 2 = (y - z? > 0. 

Fact A. 3. For all y > z e R, |sgn((/) • y 2 - sgn(z) • z 2 \ < 
(y-z)(\y\ + \z\). 

Proof. 

1. IfsgnO) = sgn(z),then|sgn(j/)-i/ 2 -sgn(z)-z 2 | = \y 2 -z 2 \ = 
(y-z)-\y + z\ = (y- z)(\y\ + \z\) as y > z. 

2. If sgn(i/) + sgn(i/), then since y>z,{y-z) = \y\ + \z\. 
Hence, \sgn(y) ■ y 2 - sgn(z) ■ z 2 \ = y 2 + z 2 < (\y\ + \z\) 2 = 
(y-z)(\y\ + \z\). 

Fact A.4. For all y> z e R, (y-z) 2 < 2(sgn(t/)-j/ 2 -sgn(z)-z 2 ). 



Proof. Part (2). 

E E(x,. - * avg ) 2 Fa « (1) E E|h - , avg || 2 round = bmty 
1. If sgnQ/) = sgn(z), (j/ - zf = y 2 + z 2 - 2yz < y 2 + " ^ " '~" 

z 2 - 2z 2 = y 2 - z 2 as y > z. Since sgn(i/) = sgn(z), Hence for some p2 be fixed later 
y 2 -z 2 < 2(sgn(i/) • j/ 2 - sgn(z) • z 2 ). 



2. If sgnQ/) * sgn(z), (*/ - z) 2 = (|y| + |z|) 2 < 2(|*/| 2 + |zf) 
2(sgn(t/) • y 2 - sgn(z) • z 2 ). Here, we have used Fact 



A.2 



E(X; - X aV g) 2 > 1/P2 ■ E||Oi - U aV g| 



P2- 



Part (3). Let R d = {/ e V : Ik - u avg || 2 < 32 • d-*)/*}. 

We also need the following standard facts. Hpf _ , 

Let cr = 4 ■ V2 V^W- By Markov's Inequality, p(R) < V^ 2 - 

Fact A. 5. Let v e R d be a vector of length I and u a unit As {y;};<=y is roundable, for all i, j e R, - vj\\ < 2cr. Hence, 

vector chosen uniformly at random in S f_1 . TTien, - uj\\ > V 2 °" ■ - "j\\ 2 f° r such i, j e 7?. This, together with 

the roundability of {t>;}; e y, implies that 

7. E„ (i) r w) = ^r,and 

V 7 E lit;,- - D J > i/mcr. 

2. /or < c5 < 1, P„ [ • < < 35. 

For any k e R, we can apply the triangle inequality for the 
Fact A.6. Let Y be a non-negative random variable such that Euclidean norm as follows 

P[7 < K] = 1 and E[Y] > 5. Then, . . 

E \\iH-Vj\\< E (|h - v k \\ + - B,||) 

P[F>*/2]>— . <2 - Elh-^l. 



The following lemma about projections will be crucial jj ence f or all £ e 7? 
in the proof of Theorem |2.1| It is a simple adaptation of an 

argument appearing in fl4f E ||u f - v k \\ > 7256a-. 

i~MR 

Lemma A.2. (Projection) Given a roundable embedding {v; e _ , „ , ,,„,,„. 

def r- Let ^* be the set ( !£S: ll^i - v k\\ > 75i2o-}. Since \\vi - v k \\ < 

R^hw, consider the embedding x e R" such that x t = W • 2 cr, applying Fact|A6]yields that, for all /t e 7?, 
M T u,-, where u e S , anrf assume without loss of generality 

that x\ > . . . > x„. Then, there exists c e (0, b] such that with P [i e 7?^] > '/i 0240-2 - 

probability £lt,(l) over the choice of u G S'' -1 , the following 

conditions hold simultaneously: p or a jj vertices / € R^, by Fact|A5 



7. B {iJ}eE ( Xi - xj) 2 < oJE^Ih - vj\\ 2 ) = O b (y), 2 

« 3 

2. E,w„(x,- - x avg ) 2 = Ob(i), and 

def def 

Let 5 = V 2 • 7 4 608o- = 1/92160-. Consider the event & — {i e 

3. f/zere e»'s/s 1 < / < n with vol({l, ...,/})> c-vol(G) and, A | x . _ > 2 • 5} Then 
f/iere exists I < r < n such that vol({r, . . . , n}) > c • vol(G) 

such that xi-x r > D. b {Y). P [£] = P [i s R k ] ■ V[\x t - x k \ > 2 • 6 \i € R k ] 

u, {i,k}~ft R Xfi B {i,k}~/i R xii R u 

Proof. We are going to lower bound the probability, over u, > \ 2 _ I drf 

of each of (1), (2) and (3) in the lemma and then apply the 1024cr 2 3 1536cr 2 

union bound. Hence, from Fact |A6] with probability at least P/2 over 

Part (1). By applying Fact|A5]to v = v t - Vj and noticing directions ^ for a fraction p/2 of pairs {/> fc) g fl X ^ \ Xk _ x .\ > 2 . 

• \v T u\ = \Xi - x:\, we have def 

J d. Let v be the median value of U,|, E y. Let L = {1 : x t < v - 0} 

def 

E E (jc«-Jt/) 2 = E ll^-^ll 2 . and H = {i-Xi>v + 6}. Any pair {i, j] e 7J x 7? with 

" ( i --'') s£ &fi eE |jCj - > 2 ■ 5 has at least one vertex in L U 77. Hence, 

Hence, by Markov's Inequality, for some Pl to be fixed later ^ ^ H)> ^ . pji . ^ {R)2 > p/4 . > p/[& = 

P 



E (xi-Xj) 2 >y Pl - E \\ui-Uj\\ 2 

{LfleE UJ)eE 



p x Assume /j.(L) > p/32, otherwise, apply the same argument to 

77. Let / be the largest index in L. For all i e L and j such that 
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Xj > v, we have \x, - xi\ > 6. (Similarly, let r be the smallest 
index in H.) This implies that, 



def 

Let Si = [I, . . . , i] and let <p be the minimum conductance of 
Si over all / < i < ,r 



\xi - x L „ /2j | > 6 

with probability at least p/2 = Q/,(l), satisfying the required 
condition. Let p^ be the probability that this event does not 
take place. Then, 

p3 < 1 - P/2. 

To conclude the proof, notice that the probability that all three 
conditions do not hold simultaneously is, by a union bound, 
at most pi + P2+P3- Setting p\-p2- p/s = £2/ 3 (l), we satisfy 
the first and third conditions and obtain 

Pi +Pz +P3 < 1 -p ■ 0/2 - Vs - l /5) < 1 - p/io. 

Hence, all conditions are satisfied at the same time with 
probability at least p/io = 0^,(1). 

From this proof, it is possible to see that the parameter c in 
our rounding scheme should be set to p/32. 

We are now ready to give a proof of Theorem |2.1| It is 
essentially a variation of the proof of Cheeger's Inequality, 
tailored to produce balanced cuts. 

Proof. [Proof of Theorem |2.1| For this proof, assume that x 
has been translated so that x avg = 0. Notice that the guarantees 



of A. 2 still apply. Let x, I, r and c be as promised by Lemma 



A. 2 For z e R, let sgn(z) be 1 if z > and -1 otherwise. Let 



yi = sga(Xi) ■ X; . 



Hence, 



FactlAJl 

E \yt-yj\ < E (|jc, - Xj \) ■ Qx t \ + \xj\) 

U,j)€E \uj\eE 



<JE (Xi-xj) 2 - E (\xi\ + \xj\) 2 



FactlO 



< 2 ■ E (Xi-Xi) 2 - E (x 2 +x z ) 

A/ (ijlefi " [i,j)€E K 1 J' 



I 2 in 

= -.12 • E (xj - X.) 1 ■ — • E Xj 2 
^ [i,j)€E m Hi 

= U ■ E ( Xi - x>) 2 ■ E Xi 2 

Lemma[AJl-(l),(2) 



o*(Vx)- 

Now we lower bound E^eg \y t - yf\. Notice that if x,- > Xj 
then yt > and vice- vers a. Hence, 

yi> ...>y„. 



1 

E \yi - yj\ = -T |£(Si,Sj)l • (j/i - i/i+i) 



[i,j}£E 



z 



min[vol(S,), vol(5,)} 



Lemma[A!2l-(3) 



|£| 



(«/; - i/i+i) 



• (p-(yi- y r ) 

FactlA4l 

> Q fc (l)-0-(x, -x,) 2 

Lemma [a!2| 

> Ofc(l) ■ 0. 

Hence, < <9/,( -Jy) with constant probability over the choice 
of projection vectors u. Repeating the projection O(logn) 
times and picking the best balanced cut found yields a high 
probability statement. Finally, as the embedding is in d 
dimensions, it takes 0{nd) time to compute the projection. 
After that, the one-dimensional embedding can be sorted in 
time 0(n) and the conductance of the relevant sweep cuts can 
be computed in time 0(m), so that the total running time is 
0(nd + m). 

A.5 Proof of Lemma [231 

A.5.1 Preliminaries For the rest of this section the norm 
notation will mean the norm in the subspace So- Hence 
||A|| = ||JAI~||. We will need the following lemmata. 

Lemma A. 3. (Johnson-Lindenstrauss) Given an embedding 
{vj € R"}, e y, V — [n], let U\,U%, ...,Mjt, be vectors sampled 
independently uniformly from the n — l-ditnensional sphere of 
radius yf^Jk. Let U be the k X t matrix having the vector u, as 

def def 

i-th row and let di = Uvi. Then, for kg — 0( lo &"/s 1 ), for all 
(1 - 5) ■ \\v t - Vj\\ 2 < |ft - Vj\\ 2 <(! + *)• \\vi - Vj\\ 2 



and 



(l - d) • INI 2 < INI 2 <(i + « -Ik 



Lemma A.4. (|9 |) There exists an algorithm EXPV which, on 
input of a matrix A e R" x ", a vector u e R" and a parameter 
7], computes a vector v G R", such that \\v — e u\\ < ||c~ A ||2 • t\ 
in time 0(t A log 3 (7'/))- 

The algorithm EXPV is described in g) and (8) . 
A.5.2 Proof We define the U s algorithm in Figure [8] and 



proceed to prove Lemma 2.3 



Proof. We verify that the conditions required hold. 



By construction, X > 0, as X = yz-B T B, and L(K V )»X = 
1. 

1 = (i/Vz • 5) r (VVz • B) and B is a k s x « matrix, with 



£,j = <9(log n), by Lemma A. 3 



We perform kg = (9(log«) calls to the algorithm EXPV, 
each of which takes time <9(?a) = 0(t M + n). Sampling 
the vectors {M/h,...^ and computing Z also requires 6{n) 
time. Hence, the total running time is 0(t M + ri). 

Let U be the kg x n matrix having the sampled vectors 
Mi, . . . , Uk s as rows. Let {u;}j e y be the embedding cor- 
responding to matrix F =' D^ 2 e^ A D^ 2 , i.e., i); is the 
;-th column of Y^ 2 . Notice that X — y /l(k v )»y. Define 

def 

Dj = Uvj for all i and let Y be the Gram matrix corre- 
sponding to this embedding, i.e., Y = f (Y^ 2 ) 1 U T U(Y^ 2 ). 
Also, let F be the Gram matrix corresponding to the em- 
bedding [vthev, i-e., Y = B T B and X = y/l(k v ).y. We will 
relate Y to F and Y to F to complete the proof. 



First, by Lemma A. 3 applied to {i>,}, e y, with high proba- 
bility, for all H 

(1-6)- L(H) • Y < L(H) • F < (1 + d) ■ L(H) • Y 

and for all z e V 

(1 - 5) Ri • F < • ¥ < (1 + 6) ■ Ri • K 

In particular, this implies that (1 — <5)-l"«F<X«F< 
(1 + 5) • I • F. Hence, 

1-5 ,1+6 

■ L(H) . X < L(#) . X < ■ L(H) • X 

1+6 1-6 

and for all i 

1-6 .1+6 

Rj*X<Ri»X<: Ri • X. 

1+6 1-6 

Now we relate F and F. Let £ d = (F I/2 - F i/2 )d 1/2 . By 
Lemma [A4l 



i 

< 2m • ||F 1/2 D 1/2 ||^ • ri 1 < (2m) 2 • LCSTy) •Y-rj L . 
This also implies 

ll^lb ' IIF 1/2 £> V2 ||2 < \\E\\ F • \\Y Vl D lll \\ F 

< ( V2^ • ||F 1/2 D 1/2 || 2 ■ 77) ■ J^di^t~f 

< 2m • 77 ■ (1 + 6) ■ L(K V ) • Y. 



As D ,/2 (F - F)d 1/2 = E T E + (P 2 ) T E + E T Y'> 2 , we h 

||D 1/2 (F-F)d i/2 || 2 

< \\E T E + (Y >/2 ) T E + E T Y' /2 \\ 2 

< V3(||£|i + 2-||£||2l|F 1/2 || 2 ) 



<9-(2m) 2 ■(\+6)-L(K v )»Y ■ rj. 



and 



\L(K V ) . (F - F)| 
< L(K V ) • (£ r £) + 2 ■ |L(£y) • (£ r F 1/2 )| 

< 1/2™ • ||£|| 2 + 2/2- ■ l|£|| f ||F 1/2 || F 

< 3 ■ 2m ■ (1 +6)-L(K v ) • Y-tj. 

Finally, combining these bounds we have 

\\x -x\\ 



Y 




F 




L(Ky) • 


F 


L(tfy) • 


F 


F 




F 




L(Ky) • 


F 


L(tfy) • 


Y 


F 




F 




L(tfy) . 


K 




Y 



||F|| 2 ■ \L(K V ) . F - L(K V ) . F| | ||F - F|| 2 



L(A" V ) • F • L(K V ) • F L(^ v ) • F 

2m ■ jLCgij • Y - L(K V ) . Y\ + \\Y - Y\\ 2 
L(K V ) • Y 
12 ■ (2m) 2 • (1 + 6) ■ L(K V ) • F • 77 
" (1 -5)-L(tfy)»F 

< 12 • (2m) 2 • i+Vi-tf ■ v 
< 0(Vpoiy(«)) 

by taking 77 sufficiently small in <9(7poiy(«)). 
Hence, as ||L(#)|| 2 < 0(m) and \\Ri\\ 2 < O(m) 



and 



\L(H) ■ X - L(H) ■ X\ < O(VpoiyW) 
\Ri mX-Ri»X\< O(VpolyW). 



This, together with the fact that is/i+s > 1 - 1 /m 
l+6 li-s < 1 + 764 completes the proof. 
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1. Input: An embedding {v t e R d }; e y, b e (0, 

2. Let c = be a constant to be fixed in the proof. 

3. Forf = l,2,...,0(logn): 

(a) Pick a unit vector w uniformly at random from S d ~ l and let jeR" with d = f Vd ■ u T Ui. 

def 

(b) Sort the vector x. Assume w.l.og. that x\ > xi > . . . > x„. Define 5,- = {j e [«] : Jt/ > x,}. 

(c) Let5 w = (St, Sd which minimizes 0(5,-) among sweep-cuts for which vol (5,-) e [c • 2m, (1 - c) • 2m]. 

4. Output: The cut S ® of least conductance over all choices of t . 

Figure 7: ProjRound 



Input: A matrix M e R" x ". 

def def 

Let i] = OCVpoiy&o). Let 5 = V 512 an d s = '/i30. 



For £,5 as in Lemma A. 3 sample kg vectors u\, . . . , Uk s e R" as in Lemma A. 3 



Let A d = (2m • s) ■ D^MD^ 1 . 
For 1 < z < ij, compute vectors e R", b t d = EXPV(i/2 ■ A, D _I/2 Wj, 77). 



def 



K .112 



Let B be the matrix having /?, as z'-th row, and let Dj be the z'-th column of B. Compute Z = E(/j-} e/JX/J ||y,- - = 
L(K V )»B T B. 



«• def 



Return X = V z ■ # by giving its correspoding embedding, i.e., { 7 Vz ■ z),}; € v 



Figure 8: The E e algorithm 



